Safe Active Dynamics Learning and Control: A Sequential Exploration–Exploitation Framework

نویسندگان

چکیده

Safe deployment of autonomous robots in diverse scenarios requires agents that are capable efficiently adapting to new environments while satisfying constraints. In this article, we propose a practical and theoretically justified approach maintain safety the presence dynamics uncertainty. Our leverages Bayesian meta-learning with last-layer adaptation. The expressiveness neural-network features trained offline, paired efficient online adaptation, enables derivation tight confidence sets, which contract around true as model adapts online. We exploit these sets plan trajectories guarantee system. handles problems high uncertainty, where reaching goal safely is potentially initially infeasible, by first exploring gather data reduce before autonomously xmlns:xlink="http://www.w3.org/1999/xlink">exploiting acquired information perform task. Under reasonable assumptions, prove our framework guarantees high-probability satisfaction all constraints at times jointly, i.e., over total task duration. This theoretical analysis also motivates two regularizers models improve adaptation capabilities well performance reducing size sets. extensively demonstrate simulation on hardware.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing Bandwidth and Self-control Modeling on Learning a Sequential Timing Task

Modeling is a process which the observer sees another person's behavior and adapts his/her behavior with that which is the result of interaction. The aim of present study was to investigate and compare effectiveness of bandwidth modeling and self-control modeling on performance and learning of a sequential timing task. So two groups of bandwidth and self-control were compared. The task was pres...

متن کامل

A Learning Theory Framework for Association Rules and Sequential Events A Learning Theory Framework for Association Rules and Sequential Events

We present a framework and generalization analysis for the use of association rules in the setting of supervised learning. We are specifically interested in a sequential event prediction problem where data are revealed one by one, and the goal is to determine what will next be revealed. In the context of this problem, algorithms based on association rules have a distinct advantage over classica...

متن کامل

ORCCAD, a framework for safe robot control design and implementation

Robotic systems are typical examples of hybrid systems where continuous time aspects, related to control laws, must be carefully merged with discrete-time aspects related to control switches and exception handling. These two aspects interact in real-time to ensure an efficient nominal behaviour of the system together with safe and graceful degradation otherwise. In a mixed synchronous/asynchron...

متن کامل

the relationship between locus of control and iranian efl university students’ beliefs about language learning

this exploratory study aimed to investigate a possible relationship between learners’ beliefs about language learning and one of their personality traits; that is,locus of control (loc). both variables, beliefs and locus of control, are assumed to influence the language learning process. the internal control index (ici) and the beliefs about language learning inventory (balli) were administered...

JCLAL: A Java Framework for Active Learning

Active Learning has become an important area of research owing to the increasing number of real-world problems which contain labelled and unlabelled examples at the same time. JCLAL is a Java Class Library for Active Learning which has an architecture that follows strong principles of object-oriented design. It is easy to use, and it allows the developers to adapt, modify and extend the framewo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Robotics

سال: 2022

ISSN: ['1552-3098', '1941-0468', '1546-1904']

DOI: https://doi.org/10.1109/tro.2022.3154715